Implementation of float-float operators on graphics hardware
نویسندگان
چکیده
The Graphic Processing Unit (GPU) has evolved into a powerful and flexible processor. The latest graphic processors provide fully programmable vertex and pixel processing units that support vector operations up to single floating-point precision. This computational power is now being used for general-purpose computations. However, some applications require higher precision than single precision. This paper describes the emulation of a 44-bit floating-point number format and its corresponding operations. An implementation is presented along with performance and accuracy results.
منابع مشابه
Stream Processing in Global Illumination
This paper presents the implementation of the stochastic radiosity algorithm on the graphics hardware. We store the radiosity function in texels of the floating point pbuffer. The radiosity function is updated in each iteration. When converged, the radiosity function is mapped onto surfaces by traditional texture mapping. Our goal is to enable interactive radiosity style rendering of scenes wit...
متن کاملImplementation of an Efficient Conjugate Gradient Algorithm for Poisson Solutions on Graphics Processors
Programmable graphics processors have achieved the distinction of being very efficient and cost-effective in terms of floating-point capacity, thereby making it an attractive option for scientific computing. In this paper, we discuss the implementation of the Conjugate Gradient iterative solver on a graphics processor. A Poisson equation is solved with the graphics processor on an unstructured ...
متن کاملImplementation of the reciprocal square root in MPFR
We describe the implementation of the reciprocal square root — also called inverse square root — as a native function in the MPFR library. The difficulty is to implement Newton’s iteration for the reciprocal square root on top’s of GNU MP’s mpn layer, while guaranteeing a rigorous 1/2 ulp bound on the roundoff error. The reciprocal square root is an important function in 3D graphics, for the no...
متن کاملThe 2d Discrete Wavelet Transform on Programmable Graphics Hardware
The growing popularity of the Discrete Wavelet Transform (DWT) has boosted its tuning on all sorts of computer systems, from special purpose hardware for embedded systems to general purpose microprocessors and multiprocessors. In this paper we continue to investigate possibilities for the implementation of the DWT, focusing on state-of-the-art programmable graphics hardware. Current design tren...
متن کاملImplementing Performance Libraries on Graphics Hardware
We propose a simple method to implement floating-point vector math operations and matrix multiplication on graphics hardware, focusing on identification of details, in both software and hardware, which affect performance and ease of use. Before widespread adoption of the graphics processing unit (GPU) as another computation processor, we must address the need of application interfaces (APIs) th...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- CoRR
دوره abs/cs/0603115 شماره
صفحات -
تاریخ انتشار 2006